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Abstract. We study various optimality criteria for quantum observables. Observables are 
represented as covariant positive operator valued measures and we consider the case when the 
symmetry group is compact. Phase observables are examined as an example. 

1. Introduction 

In this work we study four optimality criteria applied on covariant quantum observables. 
Quantum observables are described as normalized positive operator valued measures. Co- 
variance means that observables transform in a consistent way under a group operation. For 
example, phase observables are defined as the normalized positive operator measures on a circle 
which are covariant under the phase shifts. 

A quantum observable can be optimal in various ways. There are at least four different ways 
how optimality can be defined. Namely, an observable can be 

(a) (approximately) sharp 

(b) extremal 

(c) postprocessing clean 

(d) preprocessing clean 

If an observable is (approximately) sharp then for any neighborhood of a point from the set of 
possible measurement outcomes, one can prepare a state such that the probability of getting a 
result from the neighborhood is (approximately) 1. This reflects the fact that an approximately 
sharp observable does not have intrinsic unsharpness - spread in the measurement outcome 
distribution depends solely on the measured state. 

A covariant observable is extremal in the convex set of all covariant observables if it cannot 
be represented as a nontrivial convex combination of other covariant observables. A convex 
mixture of observables corresponds to a random choice between measurement apparatuses, and 
thus an extremal observable is unaffected by this kind of randomness. 

After a measurement of an observable, one may try to process the obtained measurement 
outcome data to get the measurement outcome data of another observable. An observable is 
called postprocessing clean if it cannot be obtained by manipulating the measurement outcome 
data of another observable. 
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Analogously, a preprocessing clean observable cannot be obtained by manipulating the state 
before the measurement and then measuring some other observable. State manipulation is 
described by a quantum channel, which typically loses some information. Hence, a preprocessing 
clean observable is not irreversibly connected to another observable by a channel. 

In Section [2] we formulate the optimality criteria and derive some general results in the case 
of a compact symmetry group. In Section [3] we focus on the phase observables. The phase 
observables form an interesting class as there is no sharp phase observable (i.e. projection 
valued measure). This class is also a rich example for the general investigation on optimal 
observables, showing various connections and differences between the four optimality criteria. 

2. Optimal covariant observables 

2.1. Basic definitions. In quantum mechanics, observables are represented as normalized 
positive operator valued measures. We shortly recall some relevant basic concepts. For more 
details, we refer to [2], [6], [T3] . 

Let 7i be a complex separable Hilbert space. We denote by C{7i) the set of bounded operators 
and TiTi) the set of trace-class operators on Ti. We denote by O and / be the zero and indentity 
operators on 7i, respectively. Let f2 be a topological space. We denote by BiVL) the Borel a- 
algebra on Vt. 

Definition 1. A set function E : B{VL) jC(7i) is an observable if it satisfies the following 
conditions: 

• O < E(X) < / for any X E 

• E{n) = /; 

• E(UiXj) = E{Xi) for any sequence {Xi} of disjoint Borel sets, the sum converging 
in the weak operator topology. 

The set Q is called the outcome set of E. 

Two observables E and F are called equivalent, and denoted E ~ F, if there exists a unitary 
operator W e OH) such that F(X) = WE{X)W* for all X e B{Q). 

States of a quantum system are described by (and identified with) positive trace class oper- 
ators of trace 1. We denote by S{T-C) the convex set of all states. If E is an observable and g is 
a state, then the trace formula 

p^^{X):=tT[gE{X)], X e B{n), 

defines a probability measure on B{Q). The number p^{X) is interpreted as the probability 
of getting a measurement outcome x belonging to X, when the system is in the state g and the 
observable E is measured. 

There is also an equivalent description of observables which we need later. Namely, an 
observable E determines a mapping 6e from the set of states S{T-C) into the set of probability 
measures M^{fl), 

g ^ 6e(^) :=P^. 
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The mapping 6e is affine, i.e., it maps convex combinations of states to convex combinations 
of corresponding probability measures. 

In this paper we study observables which have a specific symmetry property, called covariance. 
To formulate this concept, let G be a topological group. For simplicity, here we consider only 
the case where G is the symmetry group and also the outcome space. (This corresponds to the 
situation when the symmetry group acts transitively on the outcome space VL and the stability 
subgroup is the trivial one element group consisting of the identity element e only.) By a 
unitary representation of G we mean a strongly continuous group homomorphism from G to 
the group W(7Y) of unitary operators in a Hilbert space 7i. 

Definition 2. Let f/ be a unitary representation of G in a Hilbert space Ti. An observable 
E : B{G) '^(Ti) is covariant with respect to U (or U-covariant, for short) if 

U{g)E{X)U{gr = E{gX) 

for all ^ G G and X e B{G). 

Two f/-covariant observables E and F are called U -equivalent, and denoted E ~[/ F, if there 
exists a unitary operator W G CiH) such that U{g)W = WU{g) for all g E G and F(X) = 
WE{X)W* for all X G B{G). 

Covariance of observables arises in many different situations; we refer to [2], [TT] for many 
examples. Trivially, for two fZ-covariant observables E and F, the equivalence E r^u F implies 
that E ~ F but, as we will show later (see Proposition [TD]), E ~ F does not necessarily imply 
that E ~[/ F. 

In the rest of this paper, G is a compact topological group which is Hausdorff and satisfies 
the second axiom of countability. We denote by fi the Haar measure of G, normalized so that 
/i(G) = 1. We fix a unitary representation U of G, and the set of all fZ-covariant observables is 
denoted by O. 

2.2. Approximately siiarp observables. Suppose for a moment that E : B{G) j^i'H) is 
a projection valued observable, also called a sharp observable. Then for any X G B{G) with 
E{X) 7^ O, we can choose a (pure) state g G S{T-C) such that tr [^?E(X)] = 1. This means that 
we can prepare the system in a state g such that the probability of getting an outcome x E X 
is 1. 

This property of sharp observables is a simple consequence of the fact that each non-zero 
operator E{X) has eigenvalue 1. Especially, ||E(X)|| = 1. This gives a useful generalization of 
sharp observables [^.[TB]. 

Definition 3. An observable E : B{G) j^i'H) is approximately sharp if ||E(X)|| = 1 for every 
open set X C G such that E(X) ^ O. 

Let us first note that the condition ||E(X)|| = 1 does not imply that tr [f)E(Ar)] = 1 for some 
state g. Indeed, ||E(X)|| = 1 means that 1 is in the spectrum of E(X) but 1 need not be an 
eigenvalue. The condition ||E(X)|| = 1 is equivalent to the fact that there exists a sequence 
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{f?n}nGN of (puie) states such that 

hm tr [f?„E(X)] = 1 . 

n— >oo 

Therefore, approximate sharpness means that we can prepare the system in a state Qn such 
that the probabihty of getting an outcome from X is arbitrarily close to 1. 

Now, assume that E is a [/-covariant observable. Then E(X) = O if and only if ^{X) = 0; 
see [m. On the other hand, fJ,{X) > for any (non-empty) open set X C G. Thus, E is 
approximately sharp if ||E(X)|| = 1 for every open set X (1 G, X ^ 0. 

Proposition 1. Let E, F G (9 and E ~ F. If E is approximately sharp, then also F is such. 

Proof. Since E ~ F, there is a unitary operator W such that f{X) = WE{X)W* for every 
X G B{G). This implies that the operator norms of E{X) and F(X) are the same, i.e., ||F(X)|| = 
\\WE{X)W*\\ = \\E{X)\\. □ 

Until now, it may seem that the concept of an approximately sharp observable is quite 
artificial. The usefulness and importance of this concept becomes evident in the situations when 
there are no sharp observables in the set O, but there exist approximately sharp observables 
in O. This is the case for phase observables, which we will study in Section [31 

2.3. Extremal observables. The set of all f/-covariant observables O is convex, and we denote 
by Ext O the set of extremal elements of O. A convex mixture of two observables corresponds to 
a random choice or fluctuation between two measurement apparatuses. An extremal observable 
thus describes an observable which is unaffected by this kind of randomness. 

In our earlier article [3] we have characterized the extremal observables in O. We do not 
reproduce this characterization here but we use it in the special case of phase observables in 
Section 13.51 Extremality and approximate sharpness have the following simple connection. 

Proposition 2. Let E, Ei, E2 be three observables and assume that E is a convex combination 
of El and E2. If E is approximately sharp, then also Ei and E2 are approximately sharp. 

Proof. Let X C G be an open set. As E is a convex combination of Ei and E2, we have 
E{X) = AEi(X) + (1 — \)E2{X) for some < A < 1. By the triangle inequality, we get 

||E(X)||<A||Ei(X)|| + (l-A) ||E2(X)||<1. 

Thus, if ||E(A:)|| = 1, then ||Ei(X)|| = ||E2(X)|| = 1. □ 

2.4. Postprocessing. Let Mi{G) be the convex set of probability measures on B{G). As we 
earlier discussed, each observable E defined on B{G) determines an afiine mapping Oe from 
S{n) to Mt{G). 

Definition 4. Let E, F G (9. We say that F is a postprocessing of E, and denote F ^post E, if 
there exists an affine mapping \1/ : M^iG) M^iG) such that 



(1) 
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The relation ^post on O is reflexive and transitive, thus a preorder. We denote by ~ the 
induced equivalence relation, i.e., E ~ F if and only if F =4post E =4post F. 

Definition 5. An observable E G (9 is postprocessing clean in O if for every F G O, the 
following implication holds: 

(2) E^PosT F ^ E"S5"F. 

The postprocessing relation and postprocessing cleaness have been studied in [1], [6], [8], [15], 
|21j . Some connections with the convex structure of the set of observables and the postprocess- 
ing relation were proved in [2]. In the following we derive some results for the postprocessing 
relation in the case of covariant observables. 

For each probability measure v on G, we deflne a mapping : M^[G) M^{G) by 

(3) m^{p) = p^p. 
Here v * p denotes the convolution of these measures, i.e.. 



(z/ * p) {X) = j v{g-^X) dp{g) , X G B{G) 



For E G O and v G M^{G), we denote by E,^ the observable corresponding to the afiine mapping 
o Ge. In other words, Ej, is the observable deflned by formula 

(4) E,(X) = j v{g-^X) d£{g) , X G B{G) . 

By its deflnition, E,^ is a postprocessing of E. It is straightforward to check that the fZ-covariance 
of E implies that also E^, is ?7-covariant. 

Proposition 3. Let E, F G O. The following conditions are equivalent: 

(i) F =^posT E . 

(ii) There is a probability measure v such that F = E,^ . 

Proof. As explained previously, (ii) implies (i). 

Conversely, suppose \1/ : Mi{G) — > Mi{G) is an affine mapping such that Eq. ([T]) holds. The 
mapping \1/ extends uniquely to a bounded linear mapping \& : M{G) M{G), where M[G) 
is the Banach space of Borel complex measures on G, by setting 

\l>(m) = ||mi+|| ( ||"^^"^|| ] - llmi^ll ' 



VII "^2+ 11/ Vll"^'2- 

Here m = mi+ — mi_ + im2+ — im2- is the Lebesgue decomposition of a measure m and ||m| 
is the total variation norm of m G M{G) (in the above formula, 0/0 = is assumed). 

For each g & G, we deflne the following linear isometric isomorphism A^ : M{G) —>■ M[G) 



f{h) d{Agm){h) := / f{gh) dm{h) V/ G Go{G) . 
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The mapping g ^ Agm is weak*-continuous, and it is norm continuous if m G L^{G). In this 
case, the mapping g Ag'^Ag-im is weak*-continuous. In fact, if / G C{G), then 



[f{gx) - f{hx)]d{^Ah-im){x) 



fix)diAg^Ag-im){x) - / f{x)d{Ah'^Ah-im){x 



< 

< 
< 



f{gx) d{^!Ag~im - ^!Ah~im){x) 



^ \\^Ag-.m - ^'A,-im||^, + \\f{g-) - f{h-)\\^ ||^'A;,-im||^ 
ll^ll ||A„-im - A,-im|| + \\f{g-) - f{h-)\\^ ||^|| \\m\\^j , 



and the mappings g ^ A^-im and g ^ f{g-) are both norm continuous. 

If m G L^{G), then for all / G C{G) the mapping g f f{x)d{Ag'^Ag-im){x) is 
integrable, and we have 



f{x) d{Ag^Ag-im){x) 



dg 



< 



oo 11*11 II"^IIm- 



This shows that there exists a measure G M{G) such that 



f{x)d{Ag^Ag-im){x) 



dg= / /(x)rf(vl>m)(x) V/gC(G), 



and the linear mapping m i— s> is bounded from L^{G) into M{G). 

Suppose m G L^{G) fl M^{G). If {/„}neN C C(G) is such that /„, > and /„ | 1, then 
/ fn{x) d{Ag'^Ag-im){x) t 1 since A^^'A^-im G M^{G). Therefore, / fn{x) d{^m){x) t 1 by 
dominated convergence theorem, thus showing that G M^{G). 

Clearly, A^^m = '^Agm for all m G In particular, the map g i— > A^^m is continuous, 

hence G L^{G) by Theorem 1.6 in [23] (which is unaltered even if G is not Abelian). By a 
resuh of Wendel there exists z/ G M;[*'(G') such that ^(0) = * (/> for all G 

Since QE{S{'H)),Qv{S{n)) C L^{G) (see e.g. [9]), and A^Gf = ^A^Ge by covariance of E 
and F, we have Gp = ^ o Ge. We conclude that F = E^, and thus, (i) implies (ii). □ 

Let the probability measure z/ in (jlj) be the Dirac measure 5^ in some point x E G. In this 
case, the corresponding observable E^^ has the form 



(5) 



E,SX) = E{Xx-')^E.{X). 



This observable, which we denote by E^, is therefore just a translated version of E. It is clear 
that E ~ E^.. If G is an Abelian group, then Ea;(X) = E(x ^X) = U{x ^)E{X)U{x) and 
therefore E^,. E. 



Proposition 4. Let E G (9 and v G M^{G). The following conditions are equivalent: 

(i) Ej, is approximately sharp. 

(ii) E is approximately sharp and v = Sx for some x E G (i.e. Eu = E^). 
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Proof. Let us first note that since G is Hausdorff and second countable, it is metrizable and we 
can choose a left invariant metric d for G, i.e., d{gx, gy) = d{x, y) for every g,x,y E G (see, for 
instance. Theorem 8.3 in [10] for this fact). 

Suppose that (i) holds. We make a counter assumption that z/ is not a Dirac measure, 
i.e., there are two different points x,y in the support of z/. Denote r := ^d{x,y). Then 
the open balls B{x; r) and B{y; r) are disjoint and have positive i/-measure. Thus, if we set 
a = min{u{B{x]r)),i'{B{y]r))}, then a > 0. As the metric d is left invariant, we have 
gB{x;r) = B{gx;r) for any g E G. Moreover, for any g E G, we have B{gx]r) fl B[x]r) = 
or B{gx] r) fl B{y; r) = 0. This implies that v{gB{x\ r)) < 1 — a for all g E G. It then follows 
that 



|E.(5(x;r) 



/ uig-'B{x;r))dEig) 
Jg 



< snp u{g^^B{x;r)) ■ ||E(G)|| < 1 - a < 1. 

9GG 



This is in contradiction with our assumption that (i) holds. Hence, u is the Dirac measure 6x 
for some point x E G. For every x E G and X E B{G), we have 

(6) ||E,,(Xa:)|| = ||E(X)|| . 

Therefore, ||E(X)|| = 1 for every open set X C G since the right multiplication is a homeomor- 
phism. This shows that (i)^(ii). 

The fact that (ii)^(i) is clear from iQ. □ 

As a consequence of Propositions [3] and HJ we get the following result. 

Proposition 5. Let E E O be an approximately sharp observable. Then E is postprocessing 
clean in O. Another observable E E O is postprocessing equivalent with E if and only if F = 
for some x E G. 

Proof. Let E E O he an approximately sharp observable. If F G O is such that E =^post F, then 
by Prop. [3] we have E = Fj, for some u E M^{G). But Prop. H] now implies that u = 6^ for some 
X E G, which means, in particular, that E ^'^'^ F. Therefore, E is postprocessing clean. This 
reasoning also proves the second claim. □ 

2.5. Preprocessing. Let $ : TiTi) be a linear mapping such that its adjoint $* : 

CiTi) — > is completely positive and $*(/) = /. We say that $ is a channel and denote 

the set of all channels by Ch(7i). Let Ch(7i)* be the set of the adjoints of channels, that is, 
Ch(7i)* consists of normal completely positive maps H : CiTi) — > C{7i) such that = /. 

Definition 6. Let E, F G (9. We say that F is a preprocessing of E, and denote F ^pre E, if 
there exists a channel $ : T{T-C) — > TiTi) such that 

(7) 0P = eEO<l>. 

Written in terms of the adjoint channel $*, Eq. ([7]) amounts to say that F(X) = $*(E(X)) 
for all X E B{G). 

The relation =^ph,e on O is clearly a preorder and we denote by the induced equivalence 
relation. 
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Definition 7. An observable E e (9 is preprocessing clean in O if for every F G (9, tlie following 
implication holds: 

(8) E^p,, F ^ E'^'F. 

We have some simple connections of the preprocessing relation to the other relations. 
Proposition 6. // E, F G C and E ~ F, then E F. 

Proof. The relation E ~ F means that F(-) = VrE(-)iy* for some unitary operator W . Define 
unitary channels <l>i(r) = W*TW and $2(T) = WTW* . Then Gf = Oeo^i and Qe = QfO<^2- 
Hence, E F. □ 

Proposition 7. Let E, F G C and F ^pre E. // F is approximately sharp, then also E is 
approximately sharp. 

Proof. For any X G B{Q), we get 

||F(X)||= sup tr[^F(X)]= sup tr [$(^)E(X)] < sup tr [^E(X)] = ||E(X)|| . 

gesin) Q(^s(n) eesin) 

Thus, ||F(X)|| = 1 implies that ||E(X)|| = 1. □ 
A channel $ is called U-covariant if 

(9) ^U{g)QUigr) = UigMQ^igY 
for every g G SiH) and g E G. 

Proposition 8. Let E, F G O and F ^pre E. Then there is a U-covariant channel $ such that 
([ZD holds. 

Proof. Let $ be a channel such that ([7]) holds. Since E and F are covariant, we have 

(10) e,{UigrgU{g)) = QEiUigT^gWig)) 

for every g G i5(7Y) and g E G. We define a linear mapping $ : T{H) T{T^) by setting 

(11) ^{g) = j U{gmU{gygU{g))U{gr dg . 

Let $* and $* be the adjoints of $ and $. Hence, 

(12) ¥{A) = I U{gW{U{grAU{g))U{gr dg , Ae CiTi) . 
The fact that $* is completely positive equals with 

n 

(13) J](^.|$*(4*A,)^,)>0 
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for all n = 1,2,..., Ai, An G C(H), and -j/^i, -i/^n G 7Y. Since 

dg>0 



.«,i=i 



it follows that $* is completely positive. But $*(/) = J so that $ is a channel. Using 
the invariance of the Haar integral, it is straightforward to verify that $ is f/-covariant. By 
Eq. (UnD, Gf = Geo*. □ 

Proposition [H] shows that the investigation of the preprocessing relation reduces to the study 
of covariant channels. We will need the following general result in order to characterise the set 
of fZ-covariant elements in Ch(7i)*. For a proof, see e.g. [1]. 

Proposition 9. Suppose U is a unitary representation of G in a Hilbert space Ti, and H G 
Ch(7Y)* is a U -covariant channel. There exists a separable Hilbert space K, a unitary represen- 
tation D of G in K,, and an isometry W : Ti. IC ® 7i such that 

WU{g) = {D{g)®U{g))W Wg e G , 

(14) E{A) = W*{I(^A)W WAeCin). 

We apply Proposition [9] in Section 13.71 in the study of phase observables. 

3. Optimal phase observables 

3.1. Structure of phase observables. Let 7i be a complex Hilbert space with an orthonormal 
basis {|n) I n G N}, N = {0, 1, 2, ...}. We define the lowering, raising, and number operators as 

oo oo oo 

a := Vn + l\n){n + 1\, a* := y/n -\- l\n + l){n\, N := a* a = n\n){n\, 

n=0 71=0 n=0 

respectively. Physically the Hilbert space Ti and the above operators are associated with a 
single-mode optical field. The vectors \n) are called number states. 

Coherent states l^;) := e~'^' ^'^J2'^=o 7^1*^)' where 2; G C, describe laser light which is quasi- 
monochromatic and thus can be approximated as a single-mode system; here \z\ is the energy 
and arg2; is the phase parameter of \z). 

An observable E describing a phase parameter measurement should have the interval [0, 27i) 
as its outcome space. For convenience, we use one dimensional torus T = {t G C : |t| = 1} as 
an equivalent description. Let \z) be a coherent state and t G T. Then \tz) is another coherent 
state, now having phase parameter argz + argt (addition modulo 27r). Therefore, we require 
that E describing a phase parameter measurement satisfies the condition 

(15) p%^iX)=pf,^it-'X) 

for all ^ G C, t G T and X G B{T). 

As shown in [20], an observable E : B{T) —>■ C{T-C) satisfies condition (fT5l) if and only if 

(16) U{t)E{X)U{ty = E{tX) 
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for all t e T and X G B{T), where U is the number representation of T in Ti, i.e., 

f/(t)|n) =r|n) VriGN. 

Clearly, U{t)\z) = \tz) We take this covariance condition as the definition for phase observables. 

Definition 8. An observable E : B(T) jC{T-C) is a phase observable if it satisfies the covariance 
condition (fT6|) . 

The following phase theorem characterizing phase observables has been proved in various 
different methods in [5l [121 [IB]- 

Theorem 1 (Phase Theorem). An observable E : B(T) — > C{T-C) is a phase observable if and 
only if 

{m\E{X)\n) = c^,n [ t'^-^dt = Cm,n [ e^^™"")^^ Vm,nGN, 

where the phase matrix (cm.n) is a positive semidefinite complex N x N-matrix and Cn,n = 1 for 
all n eN. 

If El , E2 are phase observables with phase matrices (c^„) and (c^„), then Ei ~[/ E2 if and 
only if there exists a sequence {A„,}„6n G T such that = A„AmC^ri- 

Using Theorem [T] we can easily see some general properties of phase observables. For in- 
stance, the probability measure of a phase observable E in a number state |n) is uniformly 
distributed, which is expected as number states do not have specific phase. Also, a phase 
observable is never sharp (a projection valued measure); see e.g. [TB] . 

We notice that [/-equivalence for two phase observables is, in general, stronger requirement 
than equivalence. This is demonstrated in the following proposition. 

Proposition 10. There exist phase observables Ei and E2 such that Ei is equivalent to E2, but 
El is not U- equivalent to E2. 

Proof. Fix z & C, < \z\ < 1, and define phase observables Ei and E2 by setting 

Ei{X) = / dtl + z t-^dt\0){l\ + z I tdt\l){0\, 
Jx Jx Jx 

E2(X) = dtl + z t'^dt\2){3\ + z tdt \3){2\ 
Jx Jx Jx 

for all X G B{T). Define a unitary operator W as 

00 

W = |2)(0| + |0)(2| + |1)(3| + |3)(1| + 

n=4 

Then E2 = IVEiH^*. From Theorem[T]one sees easily that Ei and E2 cannot be [/-equivalent. □ 
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3.2. Canonical phase observable. The canonical phase observable Ecan is the phase observ- 
able determined by the phase matrix Cm^n = 1 for all m, n G N. If E is another phase observable 
with a phase matrix (c^.n), then E ~(7 Ecan if and only if \cm,n\ = 1 for all m, n eN; see [T9] . 

The canonical phase observable Ecan has some properties which make it special among all 
phase observables. For example, it is the only phase observable (up to equivalence) which 
generates the number shifts [19]. A drawback of Ecan is that a realistic measurement scheme 
for it is not known. 

We will see that Ecan is optimal phase observable in all the four different ways we listed in 
Section [TJ However, (perhaps surprisingly) it is not the unique phase observable having this 
feature. 

3.3. State generated phase observables. Let D{z) := e^"*-^°-^ 2; e C, be the shift operator, 
and denote by i5(7i)diag the convex set of all diagonal states, that is, the states of the form 
T = 'where A„ > for all n G N and Yl'^^Q^n = 1- Obviously, the extremal 
elements of iS(7Y)diag are the one-dimensional projections of the form |n)(n|, n G N. 

The phase observable generated byT E 5(7i)diag (or a phase space phase observable) , denoted 
by E-r, is defined by 

(17) Et{X) = -[ [ D{re'^)TD{re'yrdrde (weakly) 

^ iargX Jo 

for all X G B(T) [18]. Using the decompostion of T, one gets 

00 

Et{X) = J2 ^nE\n){X) (weakly) for all X G B{T), 

n=0 

where we have denoted E|„)(„| briefly by E|„). Note that the operator T in formula f|T7|) must 
be diagonal in the number basis so that the corresponding observable would be phase shift 
covariant |18j . 

State generated phase observables are important since some of them have been measured. 
Indeed, E|o) can be seen as an angle margin observable of a phase space observable associated 
to a Q-function, and this has been measured by Walker and Carrol for a coherent state input 
field [2^]. Moreover, in principle all state generated phase observables can be measured using 
an eight-port homodyne detector [16] . 

We will see that state generated phase observables are approximately sharp and postprocess- 
ing clean. They are not extremal nor preprocessing clean. 

3.4. Approximately sharp phase observables. Let C(T) be a Banach space of continuous 
complex functions on T equipped with the sup norm. By the Riesz representation theorem, 
the topological dual M(T) of C(T) consists of regular complex Borel measures on T (or Baire 
measures). We equip M(T) with the weak-star topology. In this topology, a net {pijigj C 
M(T) converges to a point p G M(T) if lim/gj f fdpi = f fdp for all / G C(T); we denote 
p = w*-limigxpi. 
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Let 5t '■ C(T) ^ C, / (-^ ^t{f) = f{t) be the Dirac distribution (or Dirac measure) concen- 
trated on t G T. Let us write the parameter 2; of a coherent state \z) in the form z = rt, where 
r = 1 2; I G M and t = z/\z\ & T. Let us then recall the following result, proved in [T9] . 

Proposition 11. Let E be a phase observable with a phase matrix {cm,n), o,nd let m G T. Then 

w*-hmpLN = 5tu 

r— >oo ' ' 

if and only if 

lim Cm,m+k = 
m— >oo 

for all /c = 1, 2, 3, .... 

This leads to the following useful conclusion. 

Proposition 12. Let E be a phase observable with a phase matrix (cm,n)- If there exists u G T 
such that 

(18) lim Cm.m+k = 

m^oo 

for all k = 1,2,3, then E is approximately sharp. 

Proof. If condition (fT8|) holds, then w*-limr^oo P^t) = Stu by the previous proposition. Let 
X G B{T) be a nonempty open set and choose t such that tu G X. Let / G C(T) be such that 
f{tu) = 1, the support of / is contained in X and < / < 1. For any e > there exists r^> 
such that / fdp^j.^^ > 1 — e for all r > r^. Since, for all r >r^, 

1 = pI,^ {X) + pI,^ (T \ X) > j fdpl,^ + pI,^ (T \ X) > 1 - e + pI,^ (T \ X) 

it follows that ^^^^(T \ X) < e for all r > r^. Thus, lim^^oo P^t) \ ^) = ^'^d, hence, 
lim^^oo (-^) = 1- This implies that ||E(X)|| = 1. □ 

Note that (in the context of the above proof) lim^-^oo P^t)("^) ~ ^tu{,X) does not hold for 
all open X. Take, e.g., X = T \ {tu} and note that then j9^^^(X) = 1 since p^^^ is absolutely 
continuous measure with respect to the Haar measure. 

Since the canonical phase observable Ecan corresponds to the phase matrix Cm,n = 1, it follows 
from Proposition [12] that Ecan is approximately sharp (see also [9] for a different proof). 

Proposition 13. All state generated phase observables E^, T G 5(7i)diag) are approximately 
sharp. 

Proof. Let Ey be the phase observable generated by T = XlseN -^sl-^ )( ■^l ^ '5(7Y)diag- Since 
Er = X]^o'^'5^k> (weakly), the phase matrix elements of Et are = ^^o-^s^™-"- 
been shown in [19] that limm^oo cj^^^+fc ~ -'- all G N and s G N. Since |c|^^^_,„fc| < 1, we 
have limm^oo Cmm+k ~ ^ k eN. The claim follows then from Prop. [T2]with u = 1. □ 



OPTIMAL PHASE OBSERVABLES 



13 



Finally, let us notice that, as seen in the proof of Proposition [T2| 

lim {rt\ET{X)\rt) = 1 

r—*oo 

where X C T is open set and t G X. This means that in the classical limit when the energy of 
the state \z) is large, the phase parameter of \z) can be determined accurately by measuring E^. 
However, there is no phase observable E for which {rt\E{X)\rt) ~ 1 for finite energies r = \z\. 
There exist (phase shift covariant) generalized operator measures satisfying this condition; these 
measures describe measurements where only a restricted class of state preparations are available 



3.5. Extremal phase observables. Let C be the convex set of all phase matrices and let 
Ext C be the set of the extremal elements. The correspondence between the convex sets C and 
O preserves the convex structures. Especially, a phase observable E determined by a phase 
matrix (c„_m) is extremal in O if and only if (c„^m) is extremal in C. 

Let {cn,m) be a phase matrix. It is always possible to choose a Hilbert space /C and a sequence 
of unit vectors {?7„}„,gN which is total in /C such that Cn^m = {Vn\Vm) for all n, m G N; see [5]. 
The dimension of JC is uniquely determined by {cn^m) and we call dim /C the rank of {cn,m)- Let us 
denote by T(/C) the Banach space of trace-class operators on /C. As proved in [3], [T7], the phase 
matrix {cn,m) is extremal if and only if the trace-class closure of line {\Vn) {Vn\ '■ n E N} C T(/C) 
isr(/C). 

Example 1. Let ^ G C, |^| < 1. Define unit vectors r/|„ := |0) and r/|„+i := ^|0) + v^l - 
for all n G N and a phase matrix (c^ „j) by „j := (?7||?7m)- Thus, (c^^^) is the phase matrix of 
the so-called chess-board phase observable E^; see [19]. Assume that |^| ^ 1. Then the Hilbert 
space /C is C|0) + C|l) and the rank of (c| „,) is 2. Moreover, 

|0)(l|^linc{|r/i)(r/«| :nGN}=C|0)(0| + C(^|0)(l|+^|l)(0| + v^r^W|l)(l|) 
so that E^ is not extremal. If |.^| = 1, then the rank of (c|^) is 1 and E^ is automatically 
extremal; see Proposition 5 in [3]. In addition, |c| ^| = 1 for all n, m G N so that E^ ~ Ecan 
|19j . For more examples of extremal phase observables, we refer to |17j . 

Proposition 14. // the phase matrix (Cm,n) has rank greater than 1 and Cm^n ^ jor all 

m, n G N, then (cm,n) ^ Ext C, that is, the corresponding phase observable E is not extremal. 

Proof. Let rjm, f]n be two linearly independent vectors in /C, and define a (nonzero) bounded 
operator B = \rim ){r]n\ — \Vn){ Vm\ on /C. For every j G N, we have 

tr )( ^7^1] ^n,jCj^rn (^m,jCj^n CnjCj^rn (^n,j^j,m (^n,j(^j,m • 

By the Hahn-Banach theorem this implies that the set line {|'7n )( '7n.| : G is not dense in 
T(/C). Therefore, using the above cited criterion the phase matrix {cm,n) is not extremal. □ 

Fix s G N. Elements of the phase matrix (cm,n) of the state generated phase observable E\s) 
are of the form 

poo 
^0 
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where 



■'^^ ^ ^ ^ Y (max{n,s})! mm{n,.}V 7 



and 



1=0 ^ ^ 



is the associated Laguerre polynomial; see [HI [19]. Hence, the 77^- vectors of (cm,n) are the unit 
vectors n e N, of L^(M+, e~^dx), where M+ is the set of positive reals. All the functions 
n G N, are linearly independent, and therefore the rank of (cm,n) is infinite. It is clear that 
Cm,n € M for all m, n G N, and hence the phase observable E|j,) is not extremal by Proposition 

M 

Let Et be the phase observable generated by a state T = X^^q -^^l'^ )( '^1 ^ '5(7i)diag- Then 
Et = X^^o-^^^N) ^^"^ hence, E^- is not extremal. We thus have the following conclusion. 

Proposition 15. Any Et, T G i5(7i)diag; is not extremal in the convex set of all phase observ- 
ables. 

For completeness, we note that the phase matrix elements of Et are — ■^sCm,n = 

{fm\fn)y where /J = X^^o V^/n®!-^) is ^ ^^i^ vector of the Hilbert space L^(M+, e~^dx)^H, 
and it follows that rank(c^„) = 00. 

Remark 1. Since the physically relevant phase observables E7- and Ecan all have real- valued 
phase matrices, it would be interesting to study the extremals of the smaller convex set of real- 
valued phase matrices Ck = {(cm,n) ^ C \ Cm,n E M, m,n E N}. The complete characterization 
of such extemals is given in [T71 Theorem 1]. The canonical phase observable is extremal in 
Cr, but the question of the extremality of the state generated phase observables is open. The 
method of the proof of Proposition [14] cannot be directly applied in this case as the operator 
\Vm ){Vn\ — IVn ) ( Vm\ is trivial on the real Banach space Ts{K.) of selfadjoint trace class operators 
on /C = linK {?7„ : n G N}. 



3.6. Postprocessing. As explained in Section 13.41 the observables Ecan and Et are approxi- 
mately sharp. Therefore, it follows directly from Proposition [5] that they are all postprocessing 
clean in O. It still remains to check whether they are postprocessing equivalent. Let us start 
with the following observation. 

Proposition 16. Let T, T' G 5(7t:)diag. // Et = Et', then T = T' . 

Proof. Let T = Essn -^^l'^ ) ( -^l ^ '5(H)diag and T' = Y.senK\s){s\ G S{n)dii,g. Thus, c^^^ = 
YlT=o ^sCm,n and c^,^ = K'^^i^,ri- Assumc Et = Et' so that „ = c^,^ for all m, n G N. 
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For all s, k eN, 



J^> _ / -|Ns+max|0.s-2fc} / (nim{2 fc, S }) ! /"^ (,+|,„2fc|)/2r|^-2fc| 

" ^ ^' ^ s\{m8.x{2k,s})\Jo ^min{2M}We dx 

= (-ir 



^.+n.ax|0.-2.| ( ( k - 2fc | + .) /2) ! r(fc) 



^/i2ky.s\T{{\s-2k\-s)/2) 
where F is the Gamma function and we have used the following equation (see formula 7.414(11) 

Jo ra!F(l + a-7) 

Since lim^.^.^ |r(a;)| = oo for all n G N, it follows that 

c'^^ -0 

if and only if < < s. Thus, 



fc-i fc-i 

0,2k — ^s'^0,2k ~ ^s'^0,2k- 
s=0 



By induction T = T'. Indeed, = AoCq 2 = AqCq2 implies that Aq = X'q- If = A'^ for all 
s = 0,1, k — 1 then 

f \ \M 1'^) — T T' _ r. 

{Ak ^fcjCQ2(fc+i) — Co^2(fc+l) Co^2(fc+l) — U- 

Thus, Afc = A'^,. □ 

Proposition 17. Phase observables Ecan ^^c? E-r are postprocessing clean. They are all post- 
processing n on- equivalent. 

Proof. We need to prove the second claim. For two approximately sharp phase observables E 
and E' with phase matrices (cm,n) and (c'^„). Prop. [5] implies (taking into account that T is 

Abelian group) that E ~ E' equals E'(-) = U{t)E{-)U{ty for some t E T. By Theorem [1] this 
means that c'^ „ = t"^~"'Cm,n for all m, n eN. 

Let T, T' G 5(7^)diag and assume that, for some t G T, Et'(-) = U{t)ET{-)U{t)* . We have 
seen in the proof of Prop. [12] that lim^^oo m+k — li^im^oo m+k — ^ fo^ k E N. On the 
other hand, c^m+k — ^~^^^m+k^ from which t = 1 follows. Hence, E^^ E^/ only if E^^ = E^/, 
i. e. only if T = T' by Prop. [TBI 

Assume then that Ecan E^. It follows that |cq2| = 1. This is impossible since Cq2 = AoCq°2 
(see the proof of Prop. [16]) and 



oo -| /"OO -| 



cZ = I /o"(x)/2"(x)e-^dx = xe-^dx = ^ < 1 



□ 



16 CARMELI, HEINOSAARI, PELLONPAA, AND TOIGO 

3.7. Preprocessing. Let E be a phase observable determined by tlie pliase matrix {cm,n) G C. 
As sliown in [23], we can define a f/-covariant channel by formula 



r{n) 3T^ $e(T) := J2 Cn,m{m\T\n)\m){n\ G r{n) . 

m,n=0 

Moreover, this channel satisfies 

0E = eE o$E. 

^ ^can ^ 

This shows that every phase observable is a preprocessing of the canonical phase observable 
Ecan- In particular, Ecan is preprocessing clean. 

It remains to find the equivalence class of E^an in the preprocessing relation. The prepro- 
cessing clean phase observables are exactly the phase observables belonging to this equivalence 
class. 

Proposition 18. //S e Ch{H)* is such that 

E{u{t)Au{ty) = u{t)E{A)u{ty 

for all t G T and A G C-iTi), then there exists a sequence of vectors {0i}ieN in H ®H with 
\\(f)i\\ = 1 such that 

{j\E{A)\z) = J f~^<l),\ {I ®U{t)AU{ty)<f),) dt V^,jGN. 

Proof. Let p be the right regular representation of T in (T). Since T is compact and dimTi = 
oo, every separable unitary representation of T is contained in the tensor product representation 
p (g) / acting in the space {T)^n = (T; H). Therefore, /C = (T; H) and D = p I is 
the most general choice in the context of Prop. [9l 

We define the following unitary operator T : L'^ {T) ®n ®n ^ L'^ {T) ®n ®n 

[T{f ®u® v)\{t) = f{t) ®u® U{tyv. 

Clearly, 

T{p ® I ® I) = {p ® I ®U)T = {D ® U)T. 

Therefore, every isometry W : Ti IC^Ti intertwining U with D®U is of the form W = TW, 
where W is an isometry intertwining U with p ® J ® J. The most general such W is given by 

W\n) = fn®(i)n, 

where {0i}iGN is a sequence of unit vectors in H^H. By Prop. every [/-covariant S e Ch(7i)* 
is thus of the form 

= {j\W*T*{I,c®A)TW\z) 

= { T{f, ® 0,) I ih^j) ®In® A)T{f, ® 0,) ) 



f-^ { (f)j I (/ ® U{t)AU{ty) (pi ) dt 
for all A e C{n) and i,j G N. □ 
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Proposition 19. If E is a phase observable with phase matrix (cm,n), then the following are 
equivalent: 

(i) E Ecan 

(ii) there exists rio G N and a sequence {Xn}n>no in T such that Cm,n = AmA„ for all m,n > 
no- 

Proof. Let L°° (T) be the Banach space of bounded measurable functions on the unit circle T 
with the sup norm. We recall that, if E is a (not necessarily covariant) POVM based on T 
with values in 7i, then we can define a norm decreasing linear map E : L°°(T) J^O^) in the 
following way. If T G T{T-C), let p^ be the bounded complex measure on T given by 

= tr [rE(X)] VX e B(T). 

If / G L°°(T), the operator E(/) is then defined by 

tr [rE(/)] = J fit) dpUt) VT G Tin) . 

For all J G Z, let fjit) = P. Then 

Hfj) = ^'^n,j+n\n){j + n\ ifj>0 

n>0 

Hfj) = ^Cn-j,n\n- j){n\ ifj<0, 

n>0 

the sums converging in the weak sense. Clearly, E(/j)* = E(/_j) for all j G N. 

If S G Ch(H)*, then it is easy to check that H(E(/)) = E^(/), where E^ is the POVM given 

by 

E"(X) = S(E(X)) VX G BiT) . 

Suppose S is a [/-covariant channel, and let {0t}jeN be a sequence of unit vectors in 7i ® 7i 
associated to S as in Prop. [THl Then by Fourier analysis Ecan = if and only if Ecan(/j) = 
E-ifj) for all J G N. 

For J G N, we have 



(g|E=(/,)b) = y (/®f/(t)E(/,)f/(tr)0p) rft 

= / ^''"''5Zc„,,+„(0,| iI®Uit)\n){j+n\Uit)*)<j)p) dt 



n>0 



Cn,j+n / tP~^"^ {(t)g \ il ^\n){j +n\)(t)p) dt 

n>0 

^i,p-9 ( 05 I (J® \n){j + n\)(j)p) 



n>0 
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for all p,q eN. Setting 

(19) <Pk = J2^k^\r) 

r>0 

with (pl G H such that J2r>o W^kW'^ ~ 1' ^^^^ equation becomes 

(g| E-(/,) b) = 6,,,., Yl ^r^.+n { I 0^" > • 

n>0 

This must be compared with 

The two expressions are the same if and only if 

(20) ^c„,p_,+„(0^^|0P+"> = l forallp>g>0. 

n>0 

If {rin}n£N are unit vectors in H such that Cn,m = {Vn\Vm) (see [5]), eq. ( !20ll amounts to 

(21) ^(r/„®0^|r/p_,+„®0p+") = l for all|)>g>0. 

n>0 

Since 

E ® '^i^ir = E ii^^i^ir < 1 all J, k e N, 

n>0 n>0 

by Cauchy-Schwartz inequality Eq. (12T|) holds if and only if 

r/n®0^ = Wn®0r'^"VnGN and ^ = ^ = 1. 

n>0 n>0 

for all p > g > 0. These two conditions are in turn equivalent to the following 

(1) (P; = for all < n < p, and E„>p ll^pH' = 1; 

(2) r]^^(j)^ = rip+n ® 0^+" for all n,peN. 

If conditions (1) and (2) hold, let no be such that (f)Q° ^ 0. Condition (2) with n = then 
implies ?7p+no = Ap?7„(, for all p G N, and thus c„Q+p,„(,+g = W for all p, g > 0. 

If conversely there exists rio G N and a sequence {A„}„>„(, such that Cm,n = AmA^ for all 
m,n > no, then by Cauchy-Schwartz inequality rjn = Xno^nVno for all n > uq. Choose 0^ in 
Eq. ([HD such that 

<P'g = Sno,n-qKAno<P Vn G N, 

with G 7i such that ||0|| = 1 (in the above formula, 0^ = if n < no). Then the sequence 
{4^q}g,n£N Satisfies conditions (1) and (2) above. □ 

A phase observable can be preprocessing clean without being extremal. Indeed, we have the 
following example. 
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Example 2. Let uq > 1. Define Cnm = 1 ii n > uq and m > no, and Cnm = ^nm (Kronecker 
delta) otherwise. Obviously, (c„m) is a phase matrix and the corresponding phase observable 
E is not unitarily equivalent with Ecan- As a consequence of Proposition [19] we conclude that 

PRE I POST 

E ~ Ecan- However, in Proposition [5] we have shown that E Ecan equals E ~ Ecan- 
Therefore, E and Ecan are not postprocessing equivalent. Note that Cm,n £ for all m,n and 
the rank of (c„m) is no + 1 (this can be seen easily by noting that the ?7-sequence of (c„m) 
can be chosen such that the first no + 1 rjnS are mutually orthonormal and ?7„ = ?7no+i foi' all 
n > no + 1), hence E is not extremal by Prop. [HI Moreover, E is approximately sharp by 
Prop. [12] and, thus, postprocessing clean. 

We have seen that Ecan is optimal in all four ways. However, there are also other phase 
observables sharing this feature. This is illustrated in the following example. 

Example 3. Let {/i, be the canonical basis of C^, and let Vo = fi^ Vi = f2i V2 = 2~^/^(/i + 
/2), ^73 = 2~^/^(/i + if2), and //„ = /i for all n > 4. Define the phase matrix Cm,n = {VmlVn), 
and let E be the associated phase observable. 

It has been shown in [TTj that (cm,n) is a rank 2 phase matrix which is extremal in the convex 
set of phase matrices, hence E is extremal in the convex set O. Moreover, lim^^oo Cm,m+k = 1 for 
all G N, and therefore E is approximately sharp by Proposition [12] It follows from Proposition 
[5] that it is also postprocessing clean. By Proposition [19] the observable E is preprocessing clean. 
However, E 7^^/ Ecan since the ranks of the associated phase matrices are different. 

Acknowledgments. The authors thank Jukka Kiukas and Kari Ylinen for valuable discus- 
sions. 
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